### CS152: Computer Systems Architecture Circuits Recap – Digital Why And How Sang-Woo Jun Winter 2021 #### Course outline - Part 1: The Hardware-Software Interface - O What makes a 'good' processor? - Assembly programming and conventions - Part 2: Recap of digital design - Combinational and sequential circuits - How their restrictions influence processor design - ☐ Part 3: Computer Architecture - Computer Arithmetic - Simple and pipelined processors - Caches and the memory hierarchy - ☐ Part 4: Computer Systems - Operating systems, Virtual memory ### The digital abstraction "Building Digital Systems in an Analog World" ### The digital abstraction - ☐ Electrical signals in the real world is analog - Continuous signals in terms of voltage, current, - ☐ Modern computers represent and process information using discrete representations - Typically binary (bits) - Encoded using ranges of physical quantities (typically voltage) ### Aside: Historical analog computers - ☐ Computers based on analog principles have existed - Uses analog characteristics of capacitors, inductors, resistors, etc to model complex mathematical formulas - Very fast differential equation solutions! - Example: Solving circuit simulation would be very easy if we had the circuit and was measuring it - ☐ Some modern resurgence as well! - Research on sub-modules performing fast non-linear computation using analog circuitry Polish analog computer AKAT-1 (1959) Source: Topory Why are digital systems desirable? Hint: Noise ### Using voltage digitally - ☐ Key idea - Encode two symbols, "0" and "1" (1 bit) in an analog space - And use the same convention for every component and wire in system $V_L$ and $V_H$ are enforced during component design and manufacture Source: MIT 6.004 2019 L05 ### Handling noise - ☐ When a signal travels between two modules, there will be noise - o Temperature, electromagnetic fields, interaction with surrounding modules, ... - $\Box$ What if $V_{out}$ is barely lower than $V_L$ , or barely higher than $V_H$ ? - Noise may push the signal into invalid range - Rest of the system runs into undefined state! - ☐ Solution: Output signals use a stricter range than input ### Voltage Transfer Characteristic - ☐ Example component: Buffer - A simple digital device that copies its input value to its output - ☐ Voltage Transfer Characteristic (VTC): - Plot of V<sub>out</sub> vs. V<sub>in</sub> where each measurement is taken after any transients have died out. - Not a measure of circuit speed! - Only determines behavior under static input - ☐ Each component generates a new, "clean" signal! - Noise from previous component corrected "forbidden zone" ### Benefits of digital systems - ☐ Digital components are "restorative" - Noise is cancelled at each digital component - Very complex designs can be constructed on the abstraction of digital behavior - ☐ Compare to analog components - Noise is accumulated at each component - Lay example: Analog television signals! (Before 2000s) - Limitation in range, resolution due to transmission noise and noise accumulation - Contrary: digital signals use repeaters and buffers to maintain clean signals ## CS152: Computer Systems Architecture Digital Circuit Design Recap Sang-Woo Jun Winter 2021 ### Combinational and sequential circuits - ☐ Two types of digital circuits - Combinational circuit - Digital inputs Digital outputs - Output is a function of current input values - output = f(input) - Output depends exclusively on input - ☐ Sequential circuit - Have memory ("state") - Output depends on the "sequence" of past inputs #### What constitutes combinational circuits - 1. Input - 2. Output - 3. Functional specifications - The value of the output depending on the input - Defined in many ways! - Boolean logic, truth tables, hardware description languages, We've done this in CS151 - 4. Timing specifications Hinted at in CS151 - Given dynamic input, how does the output change over time? ### Timing specifications of combinational circuits - $\square$ Propagation delay $(t_{PD})$ - An upper bound on the delay from valid inputs to valid outputs - Restricts how fast input can be consumed (Too fast input → output cannot change in time, or undefined output) A good circuit has low t<sub>PD</sub> - → Faster input - → Higher performance How do we get low $t_{PD}$ ? ### Timing specifications of combinational circuits - $\Box$ Contamination delay ( $t_{CD}$ ) - A lower bound on the delay between input change to output starting to change - Does not mean output has stable value! - Guarantees that output will not change within this timeframe regardless of what happens to input No promises during XXXXXX ## The basic building block: CMOS transistors ("Complementary Metal-Oxide-Semiconductor") Everything is built as a network of transistors! ## The basic building block: CMOS FETs ☐ Remember CS151 – FETs come in two varieties, and are composed to create Boolean logic | Α | В | Z | |---|---|---| | 0 | 0 | 1 | | 0 | 1 | 1 | | 1 | 0 | 1 | | 1 | 1 | 0 | Making chips out of transistors...? Intel 4004 Schematics drawn by Lajos Kintli and Fred Huettig for the Intel 4004 50<sup>th</sup> anniversary project # The basic building block 2: Standard cell library - Standard cell - Group of transistor and interconnect structures that provides a boolean logic function - Inverter, buffer, AND, OR, XOR, ... - For a specific implementation technology/vendor/etc... - Also includes physical characteristic information - ☐ Eventually, chips designs are expressed as a group of standard cells networked via wires - Among what is sent to a fab plant Example: Various components have different delays and area! The actual numbers are not important right now | Gate | Delay<br>(ps) | Area<br>(μ²) | |----------|---------------|--------------| | Inverter | 20 | 10 | | Buffer | 40 | 20 | | AND2 | 50 | 25 | | NAND2 | 30 | 15 | | OR2 | 55 | 26 | | NOR2 | 35 | 16 | | AND4 | 90 | 40 | | NAND4 | 70 | 30 | | OR4 | 100 | 42 | | NOR4 | 80 | 32 | ### Back to propagation delay of combinational circuits - ☐ A chain of logic components has additive delay - The "depth" of combinational circuits is important - ☐ The "critical path" defines the overall propagation delay of a circuit Source: en:User:Cburnett @ Wikimedia #### Sequential circuits - ☐ Combinational circuits on their own are not very useful - ☐ Sequential logic has memory ("state") - State acts as input to internal combinational circuit - Subset of the combinational circuit output updates state #### Synchronous sequential circuits - "Synchronous": all operation are aligned to a shared clock signal - Speed of the circuit determined by the delay of its longest critical path - For correct operation, all paths must be shorter than clock speed - Either simplify logic, or reduce clock speed! ### Timing constraints of state elements - ☐ Synchronous state elements also add timing complexities - Beyond propagation delay and contamination delay - $\square$ Propagation delay $(t_{PD})$ of state elements - Rising edge of the clock to valid output from state element - $\Box$ Contamination delay (t<sub>CD</sub>) - State element output should not change for t<sub>CD</sub> after clock change - $\Box$ Setup time ( $t_{SETUP}$ ) - State element should have held correct data for t<sub>SETLIP</sub> before clock edge - $\Box$ Hold time (t<sub>HOLD</sub>) - Input to state element should hold correct data for t<sub>HOLD</sub> after clock edge #### Timing behavior of state elements - ☐ Meeting the <u>setup time</u> constraint - "Processing must fit in clock cycle" - After rising clock edge, - $\circ$ t<sub>PD</sub>(State element 1) + t<sub>PD</sub>(Combinational logic) + t<sub>SETUP</sub>(State element 2) - must be smaller than the clock period ...before the next clock #### Timing behavior of state elements - ☐ Meeting the <u>hold time</u> constraint - "Processing should not effect state too early" - After rising clock edge, - $\circ$ $t_{CD}(State element 1) + t_{CD}(Combinational logic) = Guaranteed time output will not change$ - must be larger than t<sub>HOLD</sub> (State element 2) ### Real-world implications - ☐ Constraints are met via Computer-Aided Design (CAD) tools - Cannot do by hand! - Given a high-level representation of function, CAD tools will try to create a physical circuit representation that meets all constraints - ☐ Rule of thumb: Meeting **hold time** is typically not difficult - e.g., Adding a bunch of buffers can add enough t<sub>CD</sub>(Sequential Circuit) - ☐ Rule of thumb: Meeting **setup time** is often difficult - Somehow construct shorter critical paths, or - reduce clock speed (We want to avoid this!) How do we create shorter critical paths for the same function? ### Simplified introduction to placement/routing - ☐ Mapping state elements and combinational circuits to limited chip space - Also done via CAD tools - May add significant propagation delay to combinational circuits - ☐ Example: - Complex combinational circuits 1 and 2 accessing state A - Spatial constraints push combinational circuit 4 far from state A - Path from B to A via 4 is now very long! - ☐ Rule of thumb: - One comb. should not access too many state - One state should not be used by too many comb. # Looking back: Why are register files small? ☐ Why are register files 32-element? Why not 1024 or more? Hierarchical design of a 8x1 multiplexer Propagation delay increases with more registers! ### Real-world example - ☐ Back in 2002 (When frequency scaling was going strong, but larger FETs) - Very high frequency (multi-GHz) meant: - ... setup time constraint could tolerate - ... up to 8 inverters in its critical path - Such stringent restrictions! Can we even fit a 32-bit adder there? No!